Two-level boosting classifiers ensemble based on feature selection for heart disease prediction
نویسندگان
چکیده
<span>Heart disease is a prevalent global health concern, necessitating early detection to save lives. Machine learning has revolutionized medical research, prompting the investigation of boosting algorithms for heart prediction. This study employs three datasets from University California Irvine (UCI) repository: Cleveland, Statlog, and Long Beach, with 14 features each. Recursive feature elimination support vector machine (SVM) utilized identify significant features. Five (gradient algorithm (GB), adaptive (AdaBoost), extreme gradient (XGBoost), cat boost (CatBoost) light (LightGBM)) are integrated into an ensemble model achieve best classification performance. The proposed demonstrates superior accuracy, precision, recall, f-measure, area under curve (AUC) compared individual models, achieving 93.44%, 83.33%, 79.75% accuracies Beach datasets. approach offers accurate efficient method prediction, which crucial clinical decision-making management.</span>
منابع مشابه
An Ensemble of Classifiers with Genetic Algorithm Based Feature Selection
Different data classification algorithms have been developed and applied in various areas to analyze and extract valuable information and patterns from large datasets with noise and missing values. However, none of them could consistently perform well over all datasets. To this end, ensemble methods have been suggested as the promising measures. This paper proposes a novel hybrid algorithm, whi...
متن کاملHybrid Correlation and Causal Feature Selection for Ensemble Classifiers
PC and TPDA algorithms are robust and well known prototype algorithms, incorporating constraint-based approaches for causal discovery. However, both algorithms cannot scale up to deal with high dimensional data, that is more than few hundred features. This chapter presents hybrid correlation and causal feature selection for ensemble classifiers to deal with this problem. Redundant features are ...
متن کاملFeature Selection using Distributed Ensemble Classifiers for Very Large Datasets
Datasets are becoming larger and there is an acute need to use data mining techniques to exploit the available data. The increasing size of the datasets poses a challenge to the data miners, which can be solved using two approaches – high speed computing systems, and pre-processing techniques. In this paper, we propose a solution combining the above two techniques using a distributed feature se...
متن کاملTwin Boosting: improved feature selection and prediction
We propose Twin Boosting which has much better feature selection behavior than boosting, particularly with respect to reducing the number of false positives (falsely selected features). In addition, for cases with a few important effective and many noise features, Twin Boosting also substantially improves the predictive accuracy of boosting. Twin Boosting is as general and generic as boosting. ...
متن کاملImproving Classifications for Cardiac Autonomic Neuropathy Using Multi-level Ensemble Classifiers and Feature Selection Based on Random Forest
This paper is devoted to empirical investigation of novel multi-level ensemble meta classifiers for the detection and monitoring of progression of cardiac autonomic neuropathy, CAN, in diabetes patients. Our experiments relied on an extensive database and concentrated on ensembles of ensembles, or multi-level meta classifiers, for the classification of cardiac autonomic neuropathy progression. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Indonesian Journal of Electrical Engineering and Computer Science
سال: 2023
ISSN: ['2502-4752', '2502-4760']
DOI: https://doi.org/10.11591/ijeecs.v32.i1.pp381-391